Merging of opinions in game-theoretic probability 



Vladimir Vovk 
vovk@cs . rhul .ac.uk 
http : / /vovk . net 

February 1, 2008 



Abstract 

This paper gives game-theoretic versions of several results on "merging 
of opinions" obtained in measure-theoretic probability and algorithmic 
randomness theory. An advantage of the game-theoretic versions over the 
measure-theoretic results is that they are pointwise, their advantage over 
the algorithmic randomness results is that they are non-asymptotic, but 
the most important advantage over both is that they are very constructive, 
giving explicit and efficient strategies for players in a game of prediction. 



1 Introduction 

The idea that the predictions made by two forecasters will be come closer with 
increasing information goes back at least to de Finetti (|l937l Chapter V); see 
also Savage [195J, §3.6. De Finetti's assumption was that both forecasters com- 
pute their predictions from exchangeable probability measures that are not too 
close to a power probability measure; in detail he considered only binary predic- 
tion from this point of view. Later it became quite popular in Bayesian statistics 
since it renders an element of objectivity to the subjective probability measures. 

A similar phenome non is also kno wn in economics under the name of 
"Hotelling's law" Cafter iHotelli'^ Il929l) or the "principle of minimum differ- 
entiation" . 

The first general mathematical result about convergence of predictions 
made by successful forecasters appears to be Blackwell and Dubins's paper 
(196i). blackwell and Dubins's result was about infinite- horizon forecasting, 
whereas in this paper we will be interested in one-step ahead forecasting. (In 
game-theoretic probability, Blackwell and Dubins's setting is much less natu- 
ral, since there are no stochastic assumptions imposed on wh at happens out 
side t he protocol .) An importa nt paper in th is direction was Kabanov et al 



(^1977^ fsee also IShirvaevlll996l Sect. Vn.6, IJacod and ShirvaevI l2003l and 



' . ' II " i ' ' : : — I ' n 

Greenwood and Shirva ev 1985'), which gen e ralize d earlier results bv IKakutani 
(|1948[ ) and bv iHaiekl (|195S ) and iFeldmaJ (|l958l) . The main disadvantage of 



the approach of these papers is that it is "bulk", stated in terms of absolute 
continuity and singularity of probability measures. 
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One way to obtain pointwise results about merging of opinions is to use the 
algorit h mic t heory of randomness. The first result of this kind wa s pr oved by 
Dawid ( 19851 ) (who refers to it as "Jeffreys's law" in Dawid 1984 and Dawid 
2004 . Dawid's result was for his version of von Mises's notion of randomness 



based on subsequence selection rules, and so his notion of merging was rather 
weak . A resu lt based on the stand ard notion ofrandomness was obtained by 
Vovkl (|l987a[) and later extended bv lFuiiwaral (|2007[) . 

The algorithmic randomness approach has two major weaknesses. First, 
since it is based on the notion of computability, it imposes heavy restrictions 
on the types of measurable spaces it can deal with (typically one considers 
just finite or countable observation spaces fl). Second, it is asymptotic in the 
sense that it never provides us with explicit inequalities. The von Mises-type 
notion of randomness does not assert anything at all about finite sequences of 
observations. But even the modern definitions (such as those due to Martin-L6f, 
Levin, and Schnorr) are based on a notion (the deficiency of randomness) that 
is defined only to within an additive constant and so can only be applied to 
finite sequences en masse. 

The game-theoretic ap proach to probab il ity wa s suggested in Vovk (Il993 ) 
and developed in, e.g., bawid and Vovkl (| 19991 ) . IShafer and Vovkl (|200lh . 



Kumon et al.l (|2007r ). This approach makes it possible to use all flexibility 
of the algorithmic randomness approach without paying its high price; in 
particular, all our statements are either non-asymptotic or can be stated in a 
non-asymptotic manner. 



2 Merging of opinions as criterion of success 

Let be a measurable space and 'P(O) stand for the set of all probability 
measures on ft; elements of fl will be called observations and measurable subsets 
of fl will be called local events. Suppose we have two forecasters at each step 
issuing probability forecasts for the next observation uj„ G ^ to be chosen by 
reality. The game-theoretic process of testing the forecasters' predictions can 
be represented in the following form. 

Competitive testing protocol 

Players: Reality, Forecaster I, Sceptic I, Forecaster II, Sceptic II 
Protocol: 



FOR n = 1,2, . . .: 

Forecaster I announces G 'P(rJ). 

Forecaster II announces £ '^(^)- 

Sceptic II announces /^^ : — > [0,oo] such that / f^dP^^ — 1. 

Sceptic I announces : [0,oo] such that J f^dP^ — 1. 

Reality announces ujn € fi. 
JCi :=/C^_i/^(w„). 
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END FOR 

The predictions output by Forecaster I are tested by Sceptic I, and the predic- 
tions output by Forecaster II are tested by Sceptic II. The Sceptics' success in 
detecting inadequacy of the Forecasters' predictions is measured by their capital, 
1C\ and K}^, respectively. The initial capital is 1 and the game is fair from the 
point of view of the Forecasters. The value of K}^ (resp. K,]}) is interpreted as 
the degree to which Sceptic I (resp. Sceptic II) managed to discredit Forecaster 
I's (resp. Forecaster IPs) predictions. The requirement that the Sceptics choose 
functions and taking nonnegative values reflects the restriction that they 
should never risk bankruptcy by gambling more than their current capital. 

In our protocol we allow infinite values for the functions chosen by the Scep- 
tics and, therefore, infinite values for their capital. We will use the convention 
Ooo 0. 

Let a ^ {—1,1}. For two probability measures and P" on Q. we define 
the a-divergence between them as 

^(o) (pi II pll) _±_ _ ^(^I(^))i^(^II^^))i±^Q(^^)^ 

(with the same convention Ooo := 0) where Q is any probability measure on O 
such that ^ Q and P^^ <C Q, (3^ is any version of the density of P^ w.r. to 
Q and j3^^ is any version of the density of P^^ w.r. to Q. (For example, one can 
set Q := {P^ + P^^)/2; it is clear that the value of the integral does not depend 
on t he choice of Q, 0^ and /3".) The expression ([1]) is always nonnegative: see, 



e.g., Amari and Nagaokal (2QQ0). An important special case is the Hellinger 



distance, corresponding to a = and also given by the formula 

^(0) (pi II pii) ^ 2 (^y^ - 7^3^) ' Qiduj). 



(Sometimes "Hellinger distance" refers to i£'°(P\ P^^), as in lVovklll987al or to 



V'DO(pi,pii).) 

For simplicity, in the main part of this section we will only consider the case 
where Forecaster II is "timid" on the given play of the game, in the sense that 
he does not deviate too much from Forecaster I. Formally, Forecaster II is timid 
if, for all n, P/^ <^ P// (intuitively, if Forecaster II never declares a local event 
null unless it is already null according to Forecaster I) . It should be remembered 
that the assumption of timidity is always imposed on the realized play of the 
game rather than on Forecaster IPs strategy (in general. Forecaster II is not 
assumed to follow a strategy). 

The following asymptotic result will be proved in Sect. 2] (its counterpart in 
the algorithmic theory of randomness has been recently pr oved b y Fuiiwarall2007 



Theorem 3; in the special case a = it was obtained in IVovklll987al) ~We will 
say that Sceptic I (resp. Sceptic II) becomes infinitely rich if lim„^oo /Cjj = oo 
(resp. lim„^oo IC^ = oo). 
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Theorem la Let a G (—1,1). In the competitive testing protocol: 

1. The Sceptics have a joint strategy guaranteeing that at least one of them 
will become infinitely rich if 

oo 

Y,D^-^{Pl\\P'^)^^ (2) 

n=l 

and Forecaster II is timid. 

2. Sceptic I has a strategy guaranteeing that he will become infinitely rich if 

oo 

^i?(")(pi||pii)<oo, (3) 

n=l 

Sceptic 11 becomes infinitely rich, and Forecaster II is timid. 

Sceptic I (resp. Sceptic II) becoming infinitely rich in this theorem can also be 
understood as lim sup^^^^ /Cjj = oo (resp. limsup„^(^ /Cj/ = oo); in the next 
subsection we will see that this understanding of "infinitely rich" leads to an 
equivalent statement. 



Remark Fujiwara (|2007l Section 3.1) gives a simple example showing that 



Theorem [Ta] (namely, its Part[T|) cannot be extended to the case \a\ > 1. 

Before discussing the intuition behind Theorem Hal it will be convenient to 
introduce some terminology (in part informal). We will say that Forecaster I 
(resp. Forecaster II) is successful (for a particular play of the game) if Sceptic I 
(resp. Sceptic II) does not become infinitely rich. We say that a Forecaster is re- 
liable if we believe, even before the start of the game, that he will be successful. 
For example, the Forecaster might know the true stochastic mechanism pro- 
ducing the observations, or his predictions may be computed from a well-tested 
theory. 

Part[l]of the theorem says that either ([3]) holds or at least one of the Sceptics 
becomes infinitely rich. Therefore, if both Forecasters are reliable, we expect 
their predictions to be close in the sense of 

Suppose we only know that Forecaster I is reliable; for concreteness, let 
us impose on Reality the requirement that K}^ should stay bounded. If the 
Sceptics invest a fraction (arbitrarily small) of their initial capital in strategies 
whose existence is guaranteed in Theorem Hal we will have 

hmsup/Cj,! < c^) ^ ( £ D^''^ {Pi II pii) < J] . 

(In the context of algorit hmic random ness theory this eq uivalence is called a 
criterion of randomness in IVovklll987al and lFuiiwarall2007i ') 



In the following sections we will see several elaborations of Theorem[Tal One 
useful interpretation of our results is where Forecaster I computes his predic- 
tions using some well-tested stochastic theory, and we believe him to be reliable. 
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Forecaster II represents an alternative way of forecasting. We will be interested 
in the relation between the deviation of Forecaster II's predictions from Fore- 
caster I's predictions and the degree of the former's success, as measured by 
Sceptic II's capital. 



2.1 Equivalence of the two senses of becoming infinitely 
rich 

Let us first simplify the competitive testing protocol: 

Testing protocol 

Players: Reality, Forecaster, Sceptic 

Protocol: 

/Co := 1. 

FORn = 1,2,...: 

Forecaster announces F„ G 'Pi^)- 

Sceptic announces fn : fl [0,oo] such that / /„dP„ — 1. 
Reality announces ijj„ S ft. 
ICn := /C„_i/„(w„). 
END FOR 

Now we have only one Forecaster and one Sceptic. We again refer to /C„ as 
Sceptic's capital at time n. 

The proof of the following lemma will give an efficient procedure transform- 
ing a strategy for Sceptic into another strategy for Sceptic such that the second 
strategy makes Sceptic infinitely rich in the sense lim ICn = oo whenever the 
first strategy makes him infinitely rich in the sense lim sup /C„ = 00. If 5 is a 
strategy for Sceptic, P1LO1P2IV2 ... is a sequence of moves by Forecaster and Re- 
ality, and n € {1, 2, . . .}, let /C„(5, PiuJiP2iLJ2 ■ ■ •) be Sceptic's capital achieved 
when playing S against Forecaster and Reality playing P1UJ1P2UJ2 ■ ■ ■ ■ 

Lemma 1 For any strategy S for Sceptic there exists another strategy S' for 
Sceptic such that, for all P1UJ1P2UJ2 ■ ■ ., 

limsup/C„(5, PiaJiP2^2 . . .) = 00 lim ICn{S' , P1UJ1P2UJ2 ■ ■ ■) = co- (4) 

n — >oo 



Proof This proof will use the argument from lVovk and Shafei (|2005l) (the end 



of the proof of Theorem 3), which w e learned from Sasha Shen; for another 



argument, see IShafer and Vovkl (|2001l ). Lemma 3.1. 

Let S be any strategy for Sceptic. The transformed strategy S' works as 
follows. Start playing S until /C„ exceeds 2 (play S forever if /C„ never exceeds 
2). As soon as this happens, set 1 aside and continue playing S with the initial 
active capital ICn — 1 until the active capital exceeds 2. As soon as this happens, 
set another 1 aside (decreasing the active capital by this amount) and continue 
playing S until the active capital exceeds 2, etc. 

Formally, if at the beginning of some step n the capital ICn-i attained by 
S' includes active capital IC^l-^, with /C„_i — KJ^l^ set aside earlier, and S 
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recommends move /„, the move recommended by S' is 

l^act v^act 

Jn ■ 1^ Jn 'T' ^ , 

SO that Sceptic's capital becomes K,'!^\^n{^n)^(]^n-\—Kf!^1^ at the end of step 
n. Now (ID) follows from the fact that Sceptic will set aside another 1 infinitely 
often when the antecedent of ^ is satisfied. I 

2.2 General theorem about merging of opinions 

In this subsection we drop the assumption that Forecaster II is timid; unfor- 
tunately, this will make the statement of the theorem more complicated (it is 
easy to see that TheoremlTalitself becomes false if the assumption of Sceptic II's 
timidity is removed). Even the competitive testing protocol has to be modified. 
Let us say that (iJ^i?^^), where and E^^ are local events, is an exceptional 
pair for P\ P" e V{n) if P\E^) = 0, = 0, and 

P^(E) ^0^=^ P^\E) =0 

for all £; C \ (E^UE^^). 

Modified competitive testing protocol 

Players: Reality, Forecaster I, Sceptic I, Forecaster II, Sceptic II 

Protocol: 

ICl := 1. 

ICl^ := 1. 

FORn= 1,2,...: 

Forecaster I announces P^ G 'P(^l). 
Forecaster II announces P^ G 'P(f^). 

Reality announces an exceptional pair {Ej^, E^) for P^, P}^. 
Sceptic II announces f^-.fl^ [0,oo] such that / fj^dPj^ = 1. 
Sceptic I announces f^:Q~* [0,oo] such that J f^dP^^ ~ 1. 
Reality announces uJn G i^- 
K-i :=/Cj,_i/^(w„). 

END FOR 

The identity of the player who announces an exceptional pair does not matter 
as long as it is not one of the Sceptics. One way to chose {E^, E^^) is to choose 

and /3" first and then set E^ := {(3^ = 0} and i;" {/?" = 0}. 

Without the condition of timidity of Forecaster II, the condition of agreement 
([3]) between the Forecasters has to be replaced by 

oo 

J2 {Pi II Pn) < OO and Vn:u;r.iEiu E^l (5) 

n=l 

Theorem lb Let a G (—1, 1). In the modified competitive testing protocol: 
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1. The Sceptics have a joint strategy guaranteeing that at least one of them 
will become infinitely rich if is violated. 

2. Sceptic I has a strategy guaranteeing that he will become infinitely rich if 
^ holds and Sceptic II becomes infinitely rich. 



3 Non- asymptotic version 

In many cases it will be more convenient to use the following modification of 
the a-divergence ([1} between two probability measures and on f7: 

(pi II pii) := ^^In / (/3i(a,))^(/3"(^))^Q(dc.), (6) 

where the constant a is different from -I and 1; dI"! (P^ || P") wiU also be 
referred to as a-divergence. The expression ([B]) is nonnegative: this follows 
from the fact that H]) is nonnegative. When P^ and P^^ are close to each other 
(in the sense that 

\p\iu))'^{(3'\cu))'^Q{duj), (7) 



called the Hellinger integral of order is close to 1), the ratio of ([6]) to H]) 
is close to 1. In any case, the inequality In a; < a; — 1 (for x > 0) implies that 

|a| < 1 =^ (P^ II P") < P>["1 (P^ II P") 
|a| > 1 ^ (Pi II P") > (Pi II P") . 

In principle, it is possible that D'^") (P^ || P^^) ~ oo: this happens when the 
Hellinger integral in ^ is zero (for |a| < 1) or infinity (for |a| > 1). 

Remark The version Q coinc ides, to within a constant factor and reparam- 
eterization, with Renyi's (jl96lD information gain, which in our context can be 
written as 

(PI,P") := ^log / {p\uj)np'H.uj)y-"Q{dcu), a>0, a^l, (9) 

log standing for the binary logarithm. However, we will never use the definition 
([9|) in this paper; an important advantage of ([6|) is that, for any constants ai 
and a2, the ratio of the divergences D^^^^^iP^ \\ P") and D^"^\P^ \\ P") (as well 
as the divergences D'^"^\P^ \\ P") and (pi || P")) is close to 1 for P^ and 

P^^ that are close to each other (in the sense of f3^/f3^^ « 1). 

The main result of this section is the following non-asymptotic version of 
Theorems [la] and [Tbl 

Theorem 2 In the competitive testing protocol: 
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1. For any a G M, a ^ {^Ij I}; Sceptics have a joint strategy guaranteeing 
that, for all N , 



N 



In/Clv + ln/C]i ^ II Pi') ■ (10) 

1 + a 1 — a ^ — ' 

n— 1 

2. For any a G (— oo, —1), Sceptic I has a strategy guaranteeing, for all N , 

In/Cjv + ln/C]i < DH (P/, |1 P^^) . (11) 

1 + a 1 — a 



ra=l 



We regard (|10p and (fTT|) to be true if their left-hand side is an indefinite expres- 
sion of the form oo — oo. 

Part [1] of Theorem [2] is analogous to Part [T] of Theorem [Tal We will only 
be interested in the inequality ">" of (fTO| for a £ (—1,1) (for such as both 
coefficients and are positive). By ([8|) this inequality then implies 



^ In/Cjv + ^ In/C^^ > J: {Pi II 



which is a precise quantification of Part [T] of Theorem [Tal 

Part [2] of Theorem [2] greatly strengthens the inequality "<" of pO)) in the 
case a < —1. Not only can this inequality be attained when the Sceptics 
collaborate with each other, but Sceptic I alone can enforce it, even if Sceptic 
II plays against him. It is close to being a quantification of Part [5] of Theorem 
[Tal There is, however, an essential difference between Part [2] of Theorem [51 and 
Part[21of Theorem [Tal a < — 1 in the former and a E (—1, 1) in the latter. 

By ([8]), inequality ([TT]) wiU continue to hold if D^"! i s replaced by An 
important special case is where a = —3 (considered in Vovk|[l987a . Theorem 
1); the (— 3)-divergence becomes the distance 



2 in /3"H 



in the notation of ([T]) and assuming /3^^ > 0; there is no coefficient ^ in IVovk 



1987a) 



In t he re st of this section we will prove Theorem [2l mainly following IVovk 



(|l987al ) and iFuiiwa ra (2007). There are, however, two important differences. 
First, our argument will be much more precise as compared to the 0(1) accuracy 
of the algorithmic theory of randomness. Second, we will pay careful attention to 
the "exceptional" cases where = or (3^ = 0; this corresponds to getting rid 
of the assumptio n of local absolute continuity in measure-theoretic probability 



(accomplished bv iPukelsheim 19861 ) 
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3.1 Proof of Part [T] of Theorem d 

Let Sceptic I play the strategy 



.1 {P'rl/PiY 

J n • 



= (f)^exp(i^i^H(^i|,^n)j (12) 

(the HeUinger integral in the first denominator is just the normalizing constant) 
and Sceptic II play the strategy 

J n ■ 



l + Q 



From 



exp 



(/^)^(/^^)^ 

- exp {Pi II P'^) + ^/^["l {Pi II Pi')) 

= exp(i?H (piji^ir 



we now obtain ^TU\i . 

Let us now look more carefully at the case where some of the denominators 
in or (fT5|) are zero and so the above argument is not applicable directly. If 
the HeUinger integral ([7]) at time n, 

iPiiu;))'^{Pl!{u;))'^Qidu;), (14) 

is zero, the probability measures and Pj^ are mutually singular. Choose a 
local event E such that Pl{E) = P"(f^ \ £:) = 0. If the Sceptics choose 

I joo ifujeE II Jl ifujEE 

I 1 otherwise, I oo otherwise, 

(fTU|) will be guaranteed to hold: both sides will be oo. 

Let us now suppose that the HeUinger integral ([H]) is non-zero. In and 
([13]), we interpret 0/0 as 1 (and, of course, i/0 as oo for t > 0). As soon as the 
local event 

{(3l = & /3^^ > 0) or {pl > & /3^^ = 0) 
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happens for the first time (if it ever happens), the Sceptics stop playing, in the 
sense of selecting = /j^ := 1 for all TV > n. This will make sure that (flO)) 
always holds (in the sense of the convention introduced after the statement of 
the theorem). 



3.2 Proof of Part [2] of Theorem [2] 

Fix a < — 1 and consider two strategies for Sceptic I: the one he played before, 
p^ . and the strategy 



fi = (15) 

Investing a fraction c e (0, 1) of his initial capital of 1 in strategy ((T2)) and 
investing the rest, 1 — c, in strategy Sceptic I achieves a capital of 



c 



W / oil \ ^ / , 2 ^ \ ^ flII 

„=1 V/^n/ \ n=l / n=l ^" 

To get rid of the likelihood ratio a; := Y[n=ii(^n / Pn) : we bound (fT6|) from below 

by 

inf (^cx^ exp (^i^ (P^ || P//)^ + (1 - c)IC'ix^ (17) 

/ . ^ 2 , l + c« \ 



-1 - a 



1+Q 



c— (1-c)-— (18) 



X E II ^")) C^n)'^ ■ (19) 

To optimize this lower bound, we find c such that 

2 l + o 

ci-°(l — c) — > max, 

which gives 

2 

c = 



1 — a 

For this value of c, the expression on line (flSl) equals 

/ 2 1 + n \ 2 

-l-a\i-° / 2 \ \ / 2 \i-"/-l-a 



— 1 — ay /Vl — a/ Vl — a 



= 1. 



In combination with (|T9l) this gives 

AT 



n=l 
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Taking the logarithm of both sides and regrouping, we obtain PT|) . 

It remains to consider the exceptional cases. If the Hellinger integral (|14p 
is infinite (this happens when is not absolutely continuous w.r. to P^), the 
right-hand side of pip is also infinite, and there is nothing to prove. Suppose 
is finite. When = 0, we will interpret the expressions and as oo 
(this will not affect / /^dPj^; in principle, it is now possible that / /^dP/j < 1, 
but this can only hurt Sceptic I). When (3l^ > k = 0, we, naturally, 
interpret (|12p as oo. In both cases Sceptic I's capital becomes infinite when 
/3I = or /3^i = 0, and ^ stiU holds. 



4 Proof of Theorems [Ta| and llbl 

Part[T]of Theorems [Tal and 1 1 bl immediately follows from Part[l]of Theorem[5](in 
the case of Theorem [Tbl the Forecasters' moves and /^^ have to be slightly 
redefined by setting them to cxd on and E^^, respectively). Therefore, in this 
section we will only prove Part [2] of Theorems [Ta| an dfTbl Instea, d of deducing 



this r esult from Part [2] of Theorem [2] (as was done in I\^klll987al and lFuiiwaral 



20071 ). we will prove it using methods of the theory of martingales and adapting 



the proof given in Shiryaev (Il996|. Theor e m VII . 6. 4). 



The following lemma from Fuiiwara ( 2007t) shows that all a-divergences. 



a e (—1, 1), coincide to within a constant factor. We will be using the words 
"increasing" and "decreasing" in the wide sense (e.g., a constant function qual- 
ifies as both increasing and decreasing). 

Lemma 2 Let P',P" G V{n). The function (1 - a)D^°'\P' , P") is decreasing 
in a £ {—1, 1). The function (1 + a)!?*-"-' (P', P") is increasing in a £ (—1, 1). 

Proof See the proof of Lemma 10 in Fujiwara ( 200?! . Appendix B). I 



Therefore, we could restrict ourselves only to t he Hellinger distance. We will, 
however, consider the general case (as lFuiiwarai r2007). 

We will be concerned with the following slight modification of the testing 
protocol. 

Semimartingale protocol (multiplicative representation) 
Players: Reality, Forecaster, Sceptic 
Protocol: 
/Co 1. 

FOR n = 1,2, . . .: 

Forecaster announces P„ € P{^)- 

Reality announces measurable ^„ : f2 ^ M. 

Sceptic announces /„ : ^ [0, 00] such that / /„dP„ — 1. 

Reality announces (jj„ G fi. 

/C„ := /C„_i/„(w„). 
END FOR 

The martingale protocol (resp. suhmartingale protocol^ supermartingale protocol) 
differs from the semimartingale protocol in that Reality is required to ensure 
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that, for all n, the functfon ^„ is P„-integrable and /^,idP„ is zero (resp. non- 
negative, nonpositive). 

An event is a property of the play {Pn,^n, fm^n)'^=i- We say that Sceptic 
can force an event E if he has a strategy guaranteeing that either E holds or 
K-n ^ oo as n ^ oo. Lemma [T] shows that replacing /C„ — s- oo by linisup„ /C„ = 
00 gives an equivalent definition. 

Another representation of the semimartingale protocol is: 

Semimartingale protocol (additive representation) 
Players: Reality, Forecaster, Sceptic 
Protocol: 
/Co := 1. 

FORn= 1,2,...: 

Forecaster announces P„ G 'P{^). 
Reality announces measurable : — s- R. 

Sceptic announces : ^ [— /C„_i. oo] such that / gndPn = 0. 
Reality announces ujn & ^■ 

END FOR 

The correspondence between the two representations of the semimartingale pro- 
tocol is given by gn — (/„ — 1)/C„_i. We will switch at will between the two 
representations. 

If Sceptic follows a strategy S, we will let /Cf stand for his capital at the 
end of step n (as a function of Forecaster's and Reality's moves). The additive 
representation makes it obvious that Sceptic's strategies can be mixed: 

Lemma 3 If Si,S2, ■ ■ ■ is a sequence of strategies for Sceptic and pi,p2, . . . is 
a sequence of positive weights summing to 1, there is a "master" strategy S for 
Sceptic ensuring 

oo 

fe=i 

Proof It suffices for Sceptic to set gn := 'Y^^=iPkgk,n at each step n, where 
gk,n is the move recommended by 5^. I 

Lemma 4 In the martingale protocol, Sceptic can force 

oo „ N 
5Z / ^n^-^" < oo ^ sup ^ C„(w„) < oo. 

n— 1 ^ n—1 



Proof It is easy to see that for each C > there is a strategy, say Sc, for 
Sceptic leading to capital 
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for all iV = 1, 2, . . . . It remains to mix all Sc, C — 1,2, . . ., according to Lemma 
[3] (with arbitrary positive weights). I 



Lemma 5 In the submartingale protocol, Sceptic can force 
X! ( / indPn + / CndP„ j < oo =^ sup ^ ^ 

n—1 ^ / ^ n—1 



) < OO. 



Proof It suffices to apply Lemma S] to '■— £,n — J CndPn (notice that 

/e^dp„</€^dp„). I 



Let Ie stand for the indicator of local event ECU: 
IeH := 



1 if w e £; 

otherwise. 



The following lemma is a version of the Borel-Cantelli-Levy lemma. 
Lemma 6 In the submartingale protocol, Sceptic can force 

( Vn : ^„ = Ifi^ & Pn{En) < OO ] =4> {uin G En for finitely many n) . 

V n=l J 

Proof This is a special case of Lemma [5l I 

Our application of the Borel-Cantelli-Levy lemma will be made possible by the 
following lemma, which we state using the notation introduced earlier in ([T]). 

Lemma 7 For each a G (—1,1) there exists a constant C — C{a) such that, 
for all and P", 

Pi {P^ > e/3"} < (Pi II P") . 

Proof Let E stand for the event {(3^ > e/?^^}. Since 

(Pi II P") = (^1 - |^(/3i(a;))^(/3"(a-))^Q(dc.) 



l-a'Jn 2 ' ^ ' 2 
l-c? Ie 2 ■ 2 



1 - a2 ./p; 2 



4 / 1 — a 1 + a „i±£.\ nT/rnx 
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(the first inequality follows from the fact that the geometric mean never exceeds 
the arithmetic mean, and the second inequality uses the fact that for each /? > 
the function 

1 - a 1 + a i+a 

is decreasing in x G [0,0/e], which can be checked by differentiation), we can 
set 

(the expression in the parentheses is a positive function of a G (—1, 1) since it 
takes value at a = —1 and a = 1 and the function is strictly concave). I 

We will also need the following elementary inequality, where : M ^ R is 
the truncation function 




X if a; < 1 
1 otherwise. 



Lemma 8 For each j £ (0, 1) there exists B > 1 such that, for all a; > 0, 

xU{lnx) + xU'^{\nx) < B{x - 1) + (1 - a;^) . (20) 

7 

Proof Let us first consider the case x < e. To sec that 

xlnx + xln^xK B{x - 1) H (1 - x'^) 

7 

notice that the values and the first derivatives of the two sides of this inequality 
coincide when a; = 1 (in fact, the coefficient was chosen to match the 
derivatives) and that the inequality for the second derivatives, 

3 + 21nx {B-l){l-j) 
X - ^ ' 

holds when B is sufficiently large (namely, when B > 5e^~'^/{l — 7) + 1). 
In the case x > e, the inequality becomes 

2a; < B{x - 1) + (1 - a;^) ; 

7 

since it is true for x = e (by the previous paragraph), it suffices to make sure 
that the inequality between the derivatives of the two sides holds: 

2<B - {B-l)x^-'^. 

This can be achieved by making B > (2e — e'^)/{e — e'^) > 2. I 
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Now we have all we need to prove Part [5] of Theorem [Tb] Let us first assume 
that the functions and are always positive and that = E^^ = 0, for all 
n. Our goal is to prove that Sceptic I can force 



n=l 

Substituting 

7 : 



^ {Pi II Pi') < oo ^ hminf /C,V < oo. 
1 - a _ Pi 



2 ' /3^i 

in (|20p . multiplying by f]^, integrating over Qn, and summing over n = 1, 2, . . ., 
wc obtain 

(,.|).,la»(i„|)dft. 

< "'~''"+°' i:g'"(p.;iip,;'). (21) 

n=l 

Let us combine this inequality with: 

• Lemma[5]applied to Sceptic 1 and ■— U (in • It is applicable because 
the inequality xU{hix) > x — 1, valid for all x > 0, implies 



/M^^; = /|^(.n|)A-.-do,.>/(|-i)^; 



dQ„ = 



(and is P^-integrable since ^„ < 1). 



• Lemma [S] applied to Sceptic I and i?„ := {/3^ > It will be applicable 

by Lemma [T] 

We can now see that it suffices to prove that Sceptic I can force 

(sup V U (in < eplliiJn) from some n on ) 

=^ liminf/Cj,^ < oo. (22) 

n — ^oo 

Forcing ([22]) can be achieved by forcing 

sup y In $r7^ < oo =^ hm inf JC^ < oo, (23) 
and the latter can be done with the strategy 

rl ._ ^ yll 
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It remains to get rid of the assumption (3^^ > 0, > 0, Ej^ — E^^ = 0. Our 
argument so far remains vahd if the observation space Q = S7„ is allowed to 
depend on n (it can be chosen by Sceptic I's opponents at any time prior to his 
move). In particular, we can set ri„ := \ {Ej^ U E^) and assume, without loss 
of generality, that > and f3ll > on 17„. This proves Part 2 of Theorem 

m 

To deduce Part 2 of Theorem [Ta| from Part 2 of Theorem llbl notice that, 
for all n, Pj,iE^^) = (as < P^^). Therefore, P^{El^ U E}^) = 0, and mixing 
(in the sense of Lemma [3]) any of the strategies for Sceptic I whose existence is 
asserted in Part 2 of Theorem [lb] with the strategy 



00 if w e E}^ U Ell 

1 otherwise 



we obtain a strategy for Sceptic I satisfying the condition of Part 2 of Theorem 

M 

Remark Part[2]of Theorem [5] gives a precise lower bound for the rate of growth 
of Sceptic I's capital in terms of the rate of growth of Sceptic IPs capital and 
the cumulative divergence between the Forecasters' predictions. Unfortunately, 
our proof of Part [2] of Theorems [Ta| and [lb] does not give such a bound; the 
step that prevents us from obtaining such a bound is replacing with (|23p 
(another such s tep would have been the use of Doob's martingale convergence 
theorem, as in Shirvaev 19961 but this proof avoids it). It remains an open 



problem whether such a bound exists. 



5 The rate of growth of Sceptic II's capital in 
terms of the Kullback— Leibler divergence 

Suppose Forecaster I is reliable and Sceptic I invests a part of his capital in 
strategies whose existence is asserted in Theorem [51 That theorem then gives 
rather precise bounds for the achievable rate of growth of JC^: Sceptic II can 
achieve 

In/C]^ > ^ E II p!!) Oil) (24) 

n=l 

but cannot achieve more than that: 

ln/C]i < ^ E {Pi II Pii) + 0(1). (25) 

n=l 

The problem with this is that p4)) and (|25p refer to different as: a G (—1, 1) in 
([24]) and a < — 1 in (|25l) . In this section we will derive similar (albeit cruder) 
bounds for the same a-divergence, namely for a = —1, one of the two cases 
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that have been excluded so far. The (—1) -divergence, or the KuUback-Leibler 
divergence, is defined by 



using the notation of ^ . Somew hat related r esults have bee n obta i ned i n 
algorithmic randomness theory by ISolomoiiofi ( 1978[ ): see also Vovk (|l989l) . 
Theorem 2.2. 

Remark It might be tempting to set a < — 1 in ^TU\i . This will not lead to any 
useful bounds because of the possibility /Cj^ — > 0. 

For simplicity, we will again impose the assumption of "timidity" (in a much 
stronger sense than before) on Forecaster II: on the given play of the game, 
his predictions should stay within a constant factor of the reliable Forecaster 
I. More precisely. Forecaster II is c-timid, for a constant c > 1, if for all n the 
ratio Pn/Pn (with 0/0 interpreted as 1) is bounded above by c and bounded 
below by 1/c. The value of the constant c is not disclosed to the players, and 
the strategies for the Sceptics constructed in this section will never depend on 
c. 

Let stand for max(x,0). 

Theorem 3 Let c > 1. There is a constant C > depending only on c such 
that: 

1. The Sceptics have a joint strategy in the competitive testing protocol that 
guarantees, starting from N = 3, 

ln/C]i > ^ {Pi II Pi') - C^:^^^ (ln+ /Cjv + InlniV) (26) 

71—1 

on the plays where Forecaster II is c-timid. 

2. Sceptic I has a strategy that guarantees, starting from N = S, 

^ I N 

In/Cji < J2 D^^'^ (Pn II Pn) + Cy (ln+ /C]v + InlniV) (27) 

n=l 

on the plays where Forecaster II is c-timid. 

Before proving Theorem [31 we will state a similar result for the case where the 
duration of the game, N, is known in advance. (Theorem [3] is applicable in this 
case as well, but it can be made more precise.) 

Proposition 1 Let c > 1 . There is a constant C > depending only on c such 
that: 
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1. For each N € {2, 3, . . the Sceptics have a joint strategy that guarantees 

N 

ln/C]i > 5] {Pi II Pii) - (^/iV - l) (ln/C]v + C) (28) 

n=l 

on the plays where Forecaster II is c-timid. 

2. For each N d {1,2, . . .}, Sceptic I has a strategy that guarantees 

N 

ln/C]i < (Pi II P//) + (ViV + l) (ln/C]v + C) (29) 

n=l 

on i/ie plays where Forecaster II is c-timid. 

The intuition behind Theorem [3] and Proposition [1] is that Sceptic II can 
achieve the growth rate of his logarithmic capital close to the growth rate of 
the cumulative KuUback-Leibler divergence between P^ and P//, but cannot 
achieve a better growth rate. Theorem [3] is related to the law of the iterated 
logarithm, in that it gives the accuracy of 0(V-/Vln In TV) for a reliable Forecaster 
I, whereas Proposition [1] is related to the central limit theorem, in that it gives 
the accuracy of 0{^/N). 

5.1 Proof of Part [1] of Theorem [3] and Proposition [1] 

Substituting a := —1 + 2e, with e G (0, 1), in (jTU]) gives 

n— 1 

which can be rewritten as 

lnC = -if]lny' (^^) dpi-i^ln/Cjv. (30) 

2 + 

The inequahty < 1 -\- x -\- ^^e^ gives 

^ 1 I 1 /^ri 1 2 £ 1 2 

< 1 + eln — J- + -e c In c. 
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This further impHes 



N 



" n=l ^ 



Pi' 
Pi. 

>Af2 1- (l - ^D^-'^ K II P^) + \^'o^ (31) 

n— 1 ^ ' 

^ -7 E (-^^'"^ (^^ II Pn) + r'c^ (32) 

ri=l ^ ^ 

= {P^\\K')-lNecnn\ 

n=l ^ 

(the transition from pTjl to ([5^ uses the inequahty In a; < x — 1, vahd for 
a; > 0; the expression in the parentheses in (j3ip is nonnegative because of its 
provenance). Plugging the last inequality into we obtain 

ln/C]i > ^ (Pi II P/J) - -Nec^ In^ c - ^ In/Cjv (33) 

n— 1 

If TV is known in advance (but c and /C]^ ^"^^ not), we can set e := N^^l"^ in 
(|33|) . which gives 



In^" 



^ > ^ i^(-i) (P^ II P^i) - \/Nc^-'"- W c - (V^V - l) In/C^,, 



this proves (|28p with, e.g., C :~ 2c In'^ c (in fact, we can take C arbitrarily close 
to \ In^ c if we only want to hold for sufficiently large A^). 

In the case of Theorem [51 where iV is unknown, we will use the discrete 
form (|Vovklll987bl Theorem 1) of Ville's (fToSQ) method of proving the law of 



the iterated logarithm (however, without worrying about constant factors). The 
method is based on the following simple corollary of Lemma [31 If ^2, ^3, . . . is a 
sequence of strategies for Sceptic and p2 , Pa , • • ■ is a sequence of positive weights 
summing to 1, there is a strategy S for Sceptic ensuring In/C^ > In/C^'' + Inp^ 
for all /c = 2, 3, . . . . In particular, taking oc fc~^, k = 2, 3, . . ., we obtain 
In/Cfr > In/C^" - 21nfc for k>2. 

Let 62,63, .. . be a sequence of positive numbers, to be specified later on. 
Since for each /c = 2, 3, . . . the Sceptics can ensure (I33p for e := 6fc (using simple 
strategies ([T ^ ^ P^ depending only on the Forecasters' predictions), they can 
also ensure, for all A; = 2, 3, . . ., 

N , _ 

In /C^ + 2 In A: > ^ D^-^^ (P^ || P//) - CiiV6fe (in /C]v + 2 In k) (34) 

n=l 
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where Ci (as well as C2 to Cq below) is a constant depending only on c. Weak- 
ening (p4)) to 



^ 1 
ln/C]i > ^ (pi II P^) C^Ne, - In/Cjv 

1 ^fc 



21nfc 



and setting 



Ink 

k:=\\nNl — 



(35) 



(so that Efc coincides with y^lnln N/N to within a constant factor), we further 
obtain 



N 



In/C^^ > i^n II - CaViVlnlnTV-Ca 



for > 3. This essentially coincides with 

Remark The strategies for Sceptic II constructed in our proof of Part [1] of The- 
orem [3] and Propositio n[l]was somewhat c omplex, especially in the case of The- 
orem |3l In the spirit of lSolomonofn ( 19781 ). we could take the simple "likelihood 
ratio" strategy := Pj^/P^. The expected value of \nf^ with respect to is 
£)(-!) (pi II P^)^ and according to standard results of game-theoretic probability 
Sceptic I can become infinitely rich unle ss In/Cl^, y.^_, D^-^) (Pl \\ P}}). The 
law of the iterated logarithm (see, e.g. 



Shafer and Vovk '2001, 



Chapter 5) will 

give a result similar to Part [T] of T heorem [3l and the weak law of large numbers 
(in the f orm of Proposition 6.1 in IShafer and Vovkl |200l|) or the central limit 
theorem (Shafer and Vovkll200ll Chapters 6-7) will give results similar to Part 
[1] of Proposition [1] An advantage of the proofs given in this subsection is that 
they show the dependence of Sceptic IPs capital on Sceptic I's capital; it re- 
mains to be seen whether such explicit dependence can be achieved using limit 
theorems of game-theoretic probability. 



5.2 Proof of Part [2] of Theorem [3] and Proposition [T] 

Substituting a := — 1 — 2e, with e > 0, in (fTTjl gives 

_i In/Cjv + ^ In/Cj^ < {Pi II P^i) , 

n— 1 

or, equivalently, 

ln/C5^<if:in| (^ydPi-fi±^ln/C5,. (36) 
Analogously to the transition from (|30p to ([55]) but now using 

(l^n\ ^ T , 1 /^n , 1 2 £ 1 2 
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and 



2 

n=l 

we obtain 



N 



Tl=l 



If N is known in advance, setting e :— N ^1'^ gives 



TV 



In/Cji < ^ (Pi II Pii) + + In/Cjv) (VtV + 1 

n=l 

and so completes the proof of Proposition [T] (We can take C := cln^ c in (HH), 
or, if we are interested in (|^^ holding from some N on, C ~ \ \y? c.) 
As for Theorem 131 we now have 



\^K}^<Y. D(~'^ (Pi II Pii) + iNe.c^" In' c + {inlC'^ + 21nfc) . 

n=l ^ 

Setting, as before, we now obtain 

N 

lnC<E^^"^ (^^11^") 



n=l 



+ CsV^lnlniV + Ce + (ln+/C]v + 21nlnAr) , 

which completes the proof of Theorem [3) 

6 Criteria of absolute continuity and singularity 

A simple measure-theoretic counterpart of the competitive testing protocol is the 
measurable space r2°° equipped with two probability measures, P^, P^^ g 7^(17°°). 
The generic element of f2°° will be denoted aJiW2 ■ ■ • • Let P\ (resp. P^^) be a 
regular conditional distribution of tj„ given wi . . . w„_i w.r. to the probability 
measure P^ (resp. P^^). (For regular condition al distributions to exist it suffices 



to assume that f2 is a Borel space: see, e.g., IShirvaevlll996l Theorem II. 7. 5.) 



The strategies of the two Forecasters are fixed: Forecaster I is playing P\ and 
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Forecaster II is playing P^; therefore, they cease to be active players in the 
game. 

We will consider the filtration {J-'n)'i^^Q where each is generated by 
wi , . . . , and sometimes write T for Too ■ By a normalized nonnegative 
measure-theoretic martingale w.r. t o a probability measure P on 17°° we will 
mean a martingale (Cn)J^Lo (see, e.g., Shirvaev 19961 Chapter VII) w.r. to P and 



(•^n)^o s^ch that ^0 = 1 and ^„ > for all n = 1, 2, . . .; we will allow ^„ to take 
value oo (of course, with probability zero). We will usually write ^(wi, . . . ,ujn) 
for ^„(ci;i, [^2, ■ • •) and regard ^ as a function on the set fi* of all finite sequences 
of elements of fl. A normalized nonnegative game-theoretic martingale w.r. to 
P (either P^ or P^^) is Sceptic's (either Sceptic I's or Sceptic II's, respectively) 
capital represented as a function of Reality's moves uji . . .uJn- 

To state the connection between measure-theoretic and game-theoretic no- 
tions of normalized nonnegative martingale (in the current measure-theoretic 
framework), let us say that two processes (i.e., measurable functions on fl*) 
are equivalent (w.r. to P) if they coincide P-almost surely. It is customary 
in measure-theoretic probability to identify equivalent processes (although this 
practice does not carry over to the game-theoretic framework) . It is easy to see 
that the normalized nonnegative measure-theoretic martingales are the closure 
of the measurable normalized nonnegative game-theoretic martingales w.r. to 
this relation of equivalence. 

The following proposition is a special case of the i nfinitary version of Ville's 



theorem; for a proof see, e.g. JShafer and Vovkl (|2001[ ). Proposition 8.14 



Proposition 2 Let E e T and P he either P' or P". 

1. If a normalized nonnegative measure-theoretic martingale diverges to in- 
finity when E happens, then E — Q. 

2. IfV E = 0, then there is a normalized nonnegative measure-theoretic mar- 
tingale that diverges to infinity if E happens. 

This proposition is also true for the measurable normalized nonnegative game- 
theoretic martingales and establishes the connection between the game-theoretic 
and measure-theoretic notions of a "null event" : for example, an event (in the 
measure-theoretic sense, i.e., a measurable subset of E satisfies F^{E) = 
if and only if Sceptic I has a measurable strategy that makes him infinitely rich 
on E. 

The following is a special case of the Kabanov-Liptser-Shiryaev ( 19771 ) cri- 



terion of absolute continuity and singularity. For simplicity we assume that 
for all n (this is the standard assumption of local absolute con- 



tinuity, which simplifies measure-theoretic results: see, e.g.. lJacod and Shirvaev 



20031 Sect. IV. 2c). This will ensure the timidity of Forecaster II (at least after 
changing the regular conditional distributions on a set of probability zero, both 
under P^ and P"). 

Corollary 1 In the measure-theoretic competitive testing protocol: 
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1. For any a G (—1, 1), <C F^^ if and only if ^ holds ¥^-almost surely. 

2. For any a G (-1, 1), ± P" if and only if (0j holds ¥^-almost surely. 

Proof Wc start from Part "if" of Part [l] Suppose © holds P^-almost surely 
and E is an event such that P^^(£') = 0; our goal is to prove that P^(£') — 0. Let 
Sceptic II play a measurable strategy that makes him infinitely rich on the event 
E, and let Sceptic I play the half-and-half mixture (in the sense of Lemma of 
the following two strategies: one of the measurable strategies whose existence 
is guaranteed in Part[2]of Theorem Hal (cf. the remark following this proof) and 
a measurable strategy that makes him infinitely rich when the event ^ fails 
to happen. It is easy to check that Sceptic I is guaranteed to become infinitely 
rich on the event E, no matter whether ([3]) holds or not. Therefore, indeed 
]P\e) = 0. 

Next we prove Part "only if" of Part [T] Fix measurable strategies and 
S^^ for the Sceptics that make at least one of them infinitely rich when the 
event Q fails to happen; Part [T] of Theorem [Tal guarantees that such strategies 
exist. Sceptic II will play strategy S^^ whereas Sceptic I will play a mixture of 

and another strategy. Let E be the event that Sceptic II becomes infinitely 
rich. Since P"(S) = and P^ < P", we have P^(£;) = and so Sceptic I has 
a measurable strategy that makes him infinitely rich on E; let him play the 
half-and-half mixture of this strategy and S^. This strategy for Sceptic I will 
guarantee his becoming infinitely rich whenever ^ fails to happen. 

The proof of Part "if" of Part again relies on Part [T] of Theorem [Tal Define 

and S^^ as before, let Sceptic II play and let Sceptic I play the half-and- 
half mixture of and a measurable strategy that makes him infinitely rich 
when ([2]) fails to happen. It is clear that one of the Sceptics becomes infinitely 
rich no matter what happens. Therefore, P^ and P^^ are mutually singular (e.g., 
take E as the event that Sceptic I becomes infinitely rich; then ¥^{E) — and 
¥^\n°°\E) =0). 

It remains to prove Part "only if" of Part [2l Let E be an event such that 
¥^^{E) = and ¥^{E) = 1. Let Sceptic II play a measurable strategy that makes 
him infinitely rich on E and let Sceptic I play the half-and-half mixture of the 
following two strategies: one of the measurable strategies whose existence is 
guaranteed in Part [5] of Theorem [Tal and a measurable strategy that makes him 
infinitely rich when the event E fails to happen. Now if the event ([2|) fails to 
happen. Sceptic I is guaranteed to become infinitely rich, regardless of whether 
E happens. This completes the proof. I 

Remark Notice that to deduce Corollary[l]we need slightly more than stated in 
TheoremjTal namely, we need measurable strategies for the Sceptics (in the case 
of Part[T]) or Sceptic I (in the case of PartH]). It is easy to see that the strate- 
gies constructed in Theorem [Ta| satisfy this property, but it would have been 
awkward to include their measurability in the statement. The strategies are not 
only measurable but also computable, which is a much stronger property. There- 
fore, it would have been more natural to include the strategies' computability 
in the statement of Theorem Hal however, this would significantly complicate 
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the exposition, especially that there are several popular non-equivalent defini- 
tions of computa b ility, even for real- valued functions of real variable. Following 
IShafer and Vovk ( 200l[ ). we dropped any references to the properties of regu- 



larity of the constructed strategies for the Sceptics in the formal statements of 
our results. 

To discuss the intuition behind Corollary [2 let us assume that Forecaster I 
is reliable in the following measure-theoretic sense: we do not expect an event 
E to happen if it is chosen in advance and satisfies F^{E) = 0. Then <C P^^ 
means that Forecaster II is "automatically reliable" : if £^ is chosen in advance 
and satisfies F^^{E) = 0, we also do not expect it to happen. Part [T] of Corollary 
[1] says that Forecaster II is automatically reliable if and only if the Forecasters' 
predictions are close in the sense of ([3]) holding almost surely w.r. to the reliable 
probability measure P'^. 

On the other hand, P^ _L P^^ means that Forecaster II is "automatically 
unreliable" : we can choose in advance an event E which we expect to happen 
{F\E) = 1) but which will falsify the probability measure P" (P"(£;) = 0). 
Part [2] of Corollary [1] says that Forecaster II is automatically unreliable if and 
only if the Forecasters' predictions are far apart in the sense of ([5]) holding 
almost surely w.r. to the reliable probability measure P^. 

The game-theoretic Theorem [Ta| has several advantages over Corollary [TJ 
First of all. Theorem [Ta| is "pointwise" : it carries information about specific 
plays of the game. It also has the flexibility provided by the game-theoretic 
framework in general: 

• the Forecasters can react to the Sceptics' moves (and Reality can react to 
both the Forecasters and the Sceptics); 

• all players can react to various events outside the protocol; 

• game-theoretic results about merging of opinions, unlike the standard 
measure-theoretic results about absolute continuity and singularity, do 
not depend on the (vast) parts of the space fl* that are never reached or 
even approached by Reality. 
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